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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

5 (i) APPLICANT: Genentech, Inc., Hsei, Vanessa 

Koumenis, Iphigenia 
Leong, Steven R. 
Presta, Leonard G. 
Shahrokh, Zahra 
10 Zapata, Gerardo A. 

(ii) TITLE OF INVENTION: ANTIBODY FRAGMENT- POLYMER CONJUGATES 
AND HUMANIZED ANTI-IL-8 MONOCLONAL ANTIBODIES 

15 (iii) NUMBER OF SEQUENCES: 72 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Genentech, Inc. 

(B) STREET: 1 DNA Way 

20 (C) CITY: South San Francisco 

(D) STATE : California 

(E) COUNTRY: USA 

(F) ZIP: 94080 

25 (v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 3.5 inch, 1.44 Mb floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS-DOS 

(D) SOFTWARE: WinPatin (Genentech) 

30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 20-Jan-1999 

(C) CLASSIFICATION: 

35 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60/074330 

(B) FILING DATE: 22-JAN-1998 
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(A) APPLICATION NUMBER: 60/094003 

(B) FILING DATE: 24-JUL-1998 

(vii) PRIOR APPLICATION DATA: 
45 (A) APPLICATION NUMBER: 60/094013 

(B) FILING DATE: 24-JUL-1998 
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(A) APPLICATION NUMBER: 60/075467 
50 (B) FILING DATE: 20-FEB-1998 

(viii) ATTORNEY /AGENT INFORMATION: 
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(B) REGISTRATION NUMBER: 34,659 
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(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 650/225-5530 

(B) TELEFAX: 650/952-9881 
(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



CAGTCCAACT GTTCAGGACG CC 22 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



GTGCTGCTCA TGCTGTAGGT GC 22 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: Nucleic Acid 

* (C) STRANDEDNESS: Single 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 



GAAGTTGATG TCTTGTGAGT GGC 23 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 



GCATCCTAGA GTCACCGAGG AGCC 24 
(2) INFORMATION FOR SEQ ID NO: 5: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: Nucleic Acid 

5 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

10 

CACTGGCTCA GGGAAATAAC CC 22 

(2) INFORMATION FOR SEQ ID NO : 6 : 

15 (i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 22 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



GGAGAGCTGG GAAGGTGTGC AC 22 

25 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 35 base pairs 
30 (B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(*i) SEQUENCE .DESCRIPTION: SEQ ID NO : 7 : 

35 

ACAAACGCGT ACGCTGACAT CGTCATGACC CAGTC 3 5 
(2) INFORMATION FOR SEQ ID NO: 8: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 



50 ACAAACGCGT ACGCTGATAT TGTCATGACT CAGTC 35 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 35 base pairs 

(B) TYPE: Nucleic Acid 
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(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



ACAAACGCGT ACGCTGACAT CGTCATGACA CAGTC 35 
(2) INFORMATION FOR SEQ ID NO: 10: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 
15 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



20 GCTCTTCGAA TGGTGGGAAG ATGGATACAG TTGGTGC 37 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 3 9 base pairs 

<B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



CGATGGGCCC GGATAGACCG ATGGGGCTGT TGTTTTGGC 3 9 
35 (2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: Nucleic Acid 
40 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



CGATGGGCCC GGATAGACTG ATGGGGCTGT CGTTTTGGC 39 
(2) INFORMATION FOR SEQ ID NO: 13: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 
. (D) TOPOLOGY: Linear 

55 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
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CGATGGGCCC GGATAGACGG ATGGGGCTGT TGTTTTGGC 39 
5 (2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: Nucleic Acid 
10 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

15 

CGATGGGCCC GGATAGACAG ATGGGGCTGT TGTTTTGGC 3 9 
(2) INFORMATION FOR SEQ ID NO: 15: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



CGATGGGCCC GGATAGACTG ATGGGGCTGT TGTTTTGGC 39 

30 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 369 base pairs 
35 ' (B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

40 



50 



GACATTGTCA 


TGACACAGTC 


TCAAAAATTC 


ATGTCCACAT 


CAGTAGGAGA 


50 


CAGGGTCAGC 


GTCACCTGCA 


AGGCCAGTCA 


GAATGTGGGT 


ACTAATGTAG 


100 


CCTGGTATCA 


ACAGAAACCA 


GGGCAATCTC 


CTAAAGCACT 


GATTTACTCG 


150 


TCATCCTACC 


GGTACAGTGG 


AGTCCCTGAT 


CGCTTCACAG 


GCAGTGGATC 


200 


TGGGACAGAT 


TTCACTCTCA 


CCATCAGCCA 


TGTGCAGTCT 


GAAGACTTGG 


250 


CAGACTATTT 


CTGTCAGCAA 


TATAACATCT 


ATCCTCTCAC 


GTTCGGTCCT 


300 


GGGACCAAGC 


TGGAGTTGAA 


ACGGGCTGAT 


GCTGCACCAC 


CAACTGTATC 


350 


CATCTTCCCA 


CCATTCGAA 369 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Asp He Val Met Thr Gin Ser Gin Lys Phe Met Ser Thr Ser Val 
15 10 15 

Gly Asp Arg Val Ser Val Thr Cys Lys Ala Ser Gin Asn Val Gly 
20 25 30 

Thr Asn Val Ala Trp Tyr Gin Gin Lys Pro Gly Gin Ser Pro Lys 
35 40 45 

Ala Leu He Tyr Ser Ser Ser Tyr Arg Tyr Ser Gly Val Pro Asp 
50 55 60 

Arg Phe Thr Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr He 
65 70 75 

Ser His Val Gin Ser Glu Asp Leu Ala Asp Tyr Phe Cys Gin Gin 
80 85 90 

Tyr Asn He Tyr Pro Leu Thr Phe Gly Pro Gly Thr Lys Leu Glu 
95 100 105 

Leu Lys Arg Ala Asp Ala Ala Pro Pro Thr Val Ser He Phe Pro 
110 115 120 

Pro Phe Glu 

123 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 417 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



TTCTATTGCT ACAAACGCGT ACGCTGAGGT GCAGCTGGTG GAGTCTGGGG 50 
GAGGCTTAGT GCCGCCTGGA GGGTCCCTGA AACTCTCCTG TGCAGCCTCT 100 
GGATTCATAT TCAGTAGTTA TGGCATGTCT TGGGTTCGCC AGACTCCAGG 150 
CAAGAGCCTG GAGTTGGTCG CAACCATTAA TAATAATGGT GATAGCACCT 200 
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ATTATCCAGA CAGTGTGAAG GGCCGATTCA CCATCTCCCG AGACAATGCC 250 
AAGAACACCC TGTACCTGCA AATGAGCAGT CTGAAGTCTG AGGACACAGC 300 
CATGTTTTAC TGTGCAAGAG CCCTCATTAG TTCGGCTACT TGGTTTGGTT 350 
ACTGGGGCCA AGGGACTCTG GTCACTGTCT CTGCAGCCAA AACAACAGCC 400 
CCATCTGTCT ATCCGGG 417 
(2) INFORMATION FOR SEQ ID NO: 19: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 
15 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

20 Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Pro Pro Gly 
15 10 15 

Gly Ser Leu Lys Leu Ser Cys Ala Ala Ser Gly Phe lie Phe Ser 
20 25 30 

Ser Tyr Gly Met Ser Trp Val Arg Gin Thr Pro Gly Lys Ser Leu 
35 40 45 



Glu Leu Val Ala Thr lie Asn Asn Asn Gly Asp Ser Thr Tyr Tyr 

30 50 55 60 

Pro Asp Ser Val Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ala 

65 70 75 

35 Lys' Asn Thr Leu Tyr Leu Gin Met Ser Ser Leu Lys Ser Glu Asp 

80 85 90 



Thr Ala Met Phe Tyr Cys Ala Arg Ala Leu lie Ser Ser Ala Thr 
95 100 105 

Trp Phe Gly Tyr Trp Gly Gin Gly Thr Leu Val Thr Val Ser Ala 
110 115 120 



Ala Lys Thr Thr Ala Pro Ser Val Tyr Pro 
45 125 130 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 31 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
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ACAAACGCGT ACGCTGATAT CGTCATGACA G 31 
(2) INFORMATION FOR SEQ ID NO: 21: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS ; Single 
10 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



15 GCAGCATCAG CTCTTCGAAG CTCCAGCTTG G 31 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 21 base pairs 

(B) TYPE: DNA 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



CCACTAGTAC GCAAGTTCAC G 21 
30 (2} INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: Nucleic Acid 
35 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

40 

GATGGGCCCT TGGTGGAGGC TGCAGAGACA GTG 33 
(2) INFORMATION FOR SEQ ID NO: 24: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 714 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



ATGAAGAAGA ATATCGCATT TCTTCTTGCA TCTATGTTCG TTTTTTCTAT 50 

55 

TGCTACAAAC GCGTACGCTG ATATCGTCAT GACACAGTCT CAAAAATTCA 100 

250 
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TGTCCACATC AGTAGGAGAC AGGGTCAGCG TCACCTGCAA GGCCAGTCAG 150 
AATGTGGGTA CTAATGTAGC CTGGTATCAA CAGAAACCAG GGCAATCTCC 200 

5 

TAAAGCACTG ATTTACTCGT CATCCTACCG GTACAGTGGA GTCCCTGATC 250 
GCTTCACAGG CAGTGGATCT GGGACAGATT TCACTCTCAC CATCAGCCAT 300 
10 GTGCAGTCTG AAGACTTGGC AGACTATTTC TGTCAGCAAT ATAACATCTA 350 
TCCTCTCACG TTCGGTCCTG GG ACCAAGCT GGAGCTTCGA AGAGCTGTGG 400 
CTGCACCATC TGTCTTCATC TTCCCGCCAT CTGATGAGCA GTTGAAATCT 450 

15 

GGAACTGCTT CTGTTGTGTG CCTGCTGAAT AACTTCTATC CCAGAGAGGC 500 
CAAAGTACAG TGGAAGGTGG ATAACGCCCT CCAATCGGGT AACTCCCAGG 550 
20 AGAGTGTCAC AGAGCAGGAC AGCAAGGACA GCACCTACAG CCTCAGCAGC 600 
ACCCTGACGC TGAGCAAAGC AGACTACGAG AAACACAAAG TCTACGCCTG 650 
CGAAGTCACC CATCAGGGCC TGAGCTCGCC CGTCACAAAG AGCTTCAACA 700 

25 

GGGGAGAGTG TTAA 714 
(2) INFORMATION FOR SEQ ID NO: 25: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Met Lys Lys Asn He Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 

40 Ser He Ala Thr Asn Ala Tyr Ala Asp He Val Met Thr Gin Ser 

20 25 30 

Gin Lys Phe Met Ser Thr Ser Val Gly Asp Arg Val Ser Val Thr 
35 40 45 
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Cys Lys Ala Ser Gin Asn Val Gly Thr Asn Val Ala Trp Tyr Gin 
50 55 60 



Gin Lys Pro Gly Gin Ser Pro Lys Ala Leu He Tyr Ser Ser Ser 

50 65 70 75 

Tyr Arg Tyr Ser Gly Val Pro Asp Arg Phe Thr Gly Ser Gly Ser 

80 85 90 

55 Gly Thr Asp Phe Thr Leu Thr He Ser His Val Gin Ser Glu Asp 

95 100 105 
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30 
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Leu Ala Asp Tyr Phe Cys Gin Gin Tyr Asn lie Tyr Pro Leu Thr 
110 115 120 

Phe Gly Pro Gly Thr Lys Leu Glu Leu Arg Arg Ala Val Ala Ala 
125 130 135 

Pro Ser Val Phe lie Phe Pro Pro Ser Asp Glu Gin Leu Lys Ser 
140 145 150 

Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr Pro Arg 
155 160 165 



Glu Ala Lys Val Gin Trp Lys Val Asp Asn Ala Leu Gin Ser Gly 
15 170 175 180 

Asn Ser Gin Glu Ser Val Thr Glu Gin Asp Ser Lys Asp Ser Thr 
185 190 195 

20 Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu 

200 205 210 

Lys His Lys Val Tyr Ala Cys Glu Val Thr His Gin Gly Leu Ser 
215 220 225 



Ser Pro Val Thr Lys Ser Phe Asn Arg Gly Glu Cys 
230 235 237 

(2) INFORMATION FOR SEQ ID NO: 26: 



(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 756 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



40 ATGAAAAAGA ATATCGCATT TCTTCTTGCA TCTATGTTCG TTTTTTCTAT 50 

TGCTACAAAC GCGTACGCTG AGGTGCAGCT GGTGGAGTCT GGGGGAGGCT 100 

TAGTGCCGCC TGGAGGGTCC CTGAAACTCT CCTGTGCAGC CTCTGGATTC 150 

45 

ATATTCAGTA GTTATGGCAT GTCTTGGGTT CGCCAGACTC CAGGCAAGAG 200 

CCTGGAGTTG GTCGCAACCA TTAATAATAA TGGTGATAGC ACCTATTATC 250 

50 CAGACAGTGT GAAGGGCCGA TTCACCATCT CCCGAGACAA TGCCAAGAAC 300 

ACCCTGTACC TGCAAATGAG CAGTCTGAAG TCTGAGGACA CAGCCATGTT 350 

TTACTGTGCA AGAGCCCTCA TTAGTTCGGC TACTTGGTTT GGTTACTGGG 400 

GCCAAGGGAC TCTGGTCACT GTCTCTGCAG CCTCCACCAA GGGCCCATCG 4 50 
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GTCTTCCCCC TGGCACCCTC CTCCAAGAGC ACCTCTGGGG GCACAGCGGC 500 

CCTGGGCTGC CTGGTCAAGG ACTACTTCCC CGAACCGGTG ACGGTGTCGT 550 

5 

GGAACTCAGG CGCCCTGACC AGCGGCGTGC ACACCTTCCC GGCTGTCCTA 600 

CAGTCCTCAG GACTCTACTC CCTCAGCAGC GTGGTGACCG TGCCCTCCAG 650 

10 CAGCTTGGGC ACCCAGACCT ACATCTGCAA CGTGAATCAC AAGCCCAGCA 700 

ACACCAAGGT GGACAAGAAA GTTGAGCCCA AATCTTGTGA CAAAACTCAC 750 
ACATGA 756 

15 

(2) INFORMATION FOR SEQ ID NO: 27: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 amino acids 
20 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

25 Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 

Ser lie Ala Thr Asn Ala Tyr Ala Glu Val Gin Leu Val Glu Ser 

20 25 30 

30 

Gly Gly Gly Leu Val Pro Pro Gly Gly Ser Leu Lys Leu Ser Cys 

35 40 45 

Ala .Ala Ser Gly Phe lie Phe Ser Ser Tyr Gly Met Ser Trp Val 
35 50 55 60 

Arg Gin Thr Pro Gly Lys Ser Leu Glu Leu Val Ala Thr lie Asn 
65 70 75 

40 Asn Asn Gly Asp Ser Thr Tyr Tyr Pro Asp Ser Val Lys Gly Arg 

80 85 • 90 

Phe Thr lie Ser Arg Asp Asn Ala Lys Asn Thr Leu Tyr Leu Gin 

, 95 100 105 

45 

Met Ser Ser Leu Lys Ser Glu Asp Thr Ala Met Phe Tyr Cys Ala 

110 115 120 

Arg Ala Leu lie Ser Ser Ala Thr Trp Phe Gly Tyr Trp Gly Gin 
50 125 130 135 

Gly Thr Leu Val Thr Val Ser Ala Ala Ser Thr Lys Gly Pro Ser 
140 145 150 

55 Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr 

155 160 165 
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Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val 
170 175 180 

5 Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr 

185 190 195 

Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu Ser Ser 

200 205 210 

10 

Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr Tyr lie 

215 220 225 

Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp Lys Lys 
15 230 235 240 

Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr 
245 250 251 

20 (2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: Nucleic Acid 
25 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

30 

CCAATGCATA CGCTGACATC GTGATGACCC AGACCCC 37 
(2) INFORMATION FOR SEQ ID NO: 29: 

35 (l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 



CCAATGCATA CGCTGATATT GTGATGACTC AGACTCC 37 

45 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 37 base pairs 
50 (B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

55 
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25 



30 



CCAATGCATA CGCTGACATC GTGATGACAC AGACACC 37 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRAND EDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

AGATGTCAAT TGCTCACTGG ATGGTGGGAA GATGG 35 
(2) INFORMATION FOR SEQ ID NO: 32: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 32 base pairs 
20 (B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32; 

CAAACGCGTA CGCTGAGATC CAGCTGCAGC AG 32 
(2) INFORMATION FOR SEQ ID NO: 33: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 32 base pairs 
<B) TYPE: Nucleic Acid 
(C) STRANDEDNESS: Single 
35 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 



40 CAAACGCGTA CGCTGAGATT CAGCTCCAGC AG 32 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 3 91 base pairs 

(B) TYPE: Nucleic Acid 
<C) STRANDEDNESS: Double 
(D) TOPOLOGY: Linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

GATATCGTGA TGACACAGAC ACCACTCTCC CTGCCTGTCA GTCTTGGAGA 50 
55 TCAGGCCTCC ATCTCTTGCA GATCTAGTCA GAGCCTTGTA CACGGTATTG 100 
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GAAACACCTA TTTACATTGG TACCTGCAGA AGCCAGGCCA GTCTCCAAAG 150 
CTCCTGATCT ACAAAGTTTC CAACCGATTT TCTGGGGTCC CAGACAGGTT 200 
CAGTGGCAGT GGATCAGGGA CAGATTTCAC ACTCAGGATC AGCAGAGTGG 250 
AGGCTGAGGA TCTGGGACTT TATTTCTGCT CTCAAAGTAC ACATGTTCCG 300 
CTCACGTTCG GTGCTGGGAC CAAGCTGGAG CTGAAACGGG CTGATGCTGC 350 
ACCAACTGTA TCCATCTTCC CACCATCCAG TGAGCAATTG A 391 
(2) INFORMATION FOR SEQ ID NO: 35: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Asp lie Val Met Thr Gin Thr Pro Leu Ser Leu Pro Val Ser Leu 
15 10 15 

25 Gly Asp Gin Ala Ser lie Ser Cys Arg Ser Ser Gin Ser Leu Val 

20 25 ' 30 

His Gly lie Gly Asn Thr Tyr Leu His Trp Tyr Leu Gin Lys Pro 
35 40 45 
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Gly Gin Ser Pro Lys Leu Leu lie Tyr Lys Val Ser Asn Arg Phe 
50 55 60 



Ser.Gly Val Pro Asp Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp 

35 * 65 70 75 

Phe Thr Leu Arg He Ser Arg Val Glu Ala Glu Asp Leu Gly Leu 

80 85 90 

40 Tyr Phe Cys Ser Gin Ser Thr His Val Pro Leu Thr Phe Gly Ala 

95 100 105 

Gly Thr Lys Leu Glu Leu Lys Arg Ala Asp Ala Ala Pro Thr Val 

110 115 120 



Ser He Phe Pro Pro Ser Ser Glu Gin Leu Lys 
125 130 131 

(2) INFORMATION FOR SEQ ID NO: 36: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 405 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 
55 (D) TOPOLOGY: Linear 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
GAGATTCAGC TGCAGCAGTC TGGACCTGAG CTGATGAAGC CTGGGGCTTC 50 

5 

AGTGAAGATA TCCTGCAAGG CTTCTGGTTA TTCATTCAGT AGCCACTACA 100 
TGCACTGGGT GAAGCAGAGC CATGGAAAGA GCCTTGAGTG GATTGGCTAC 150 
10 ATTGATCCTT CCAATGGTGA AACTACTTAC AACCAGAAAT TCAAGGGCAA 200 
GGCCACATTG ACTGTAGACA CATCTTCCAG CACAGCCAAC GTGCATCTCA 250 
GCAGCCTGAC ATCTGATGAC TCTGCAGTCT ATTTCTGTGC AAGAGGGGAC 300 

15 

TATAGATACA ACGGCGACTG GTTTTTCGAT GTCTGGGGCG CAGGGACCAC 350 

GGTCACCGTC TCCTCCGCCA AAACCGACAG CCCCATCGGT CTATCCGGGC 400 

20 CCATC 405 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 135 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 



30 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 

Glu lie Gin Leu Gin Gin Ser Gly Pro Glu Leu Met Lys Pro Gly 
15 10 15 



Ala Ser Val Lys lie Ser Cys Lys Ala Ser Gly Tyr Ser Phe Ser 

35 * 20 25 30 

Ser His Tyr Met His Trp Val Lys Gin Ser His Gly Lys Ser Leu 

35 40 45 

40 Glu Trp lie Gly Tyr He Asp Pro Ser Asn Gly Glu Thr Thr Tyr 

50 55 60 

Asn Gin Lys Phe Lys Gly Lys Ala Thr Leu Thr Val Asp Thr Ser 

65 70 75 



Ser Ser Thr Ala Asn Val His Leu Ser Ser Leu Thr Ser Asp Asp 
80 85 90 



Ser Ala Val Tyr Phe Cys Ala Arg Gly Asp Tyr Arg Tyr Asn Gly 

50 95 100 105 

Asp Trp Phe Phe Asp Val Trp Gly Ala Gly Thr Thr Val Thr Val 

110 115 120 

55 Ser Ser Ala Lys Thr Asp Ser Pro He Gly Leu Ser Gly Pro He 

125 130 135 
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(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 22 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 



CTTGGTGGAG GCGGAGGAGA CG 22 
15 (2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: Nucleic Acid 
20 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 

25 

GAAACGGGCT GTTGCTGCAC CAACTGTATT CATCTTCC 38 
(2) INFORMATION FOR SEQ ID NO: 40: 

30 <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE : Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 



GTCACCGTCT CCTCCGCCTC CACCAAGGGC C 31 

40 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 729 base pairs 
45 (B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

50 

ATGAAGAAGA ATATCGCATT TCTTCTTGCA TCTATGTTCG TTTTTTCTAT 50 
TGCTA.CAAAT GCATACGCTG ATATCGTGAT GACACAGACA CCACTCTCCC 100 

55 

TGCCTGTCAG TCTTGGAGAT CAGGCCTCCA TCTCTTGCAG ATCTAGTCAG 150 
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AGCCTTGTAC ACGGTATTGG AAACACCTAT TTACATTGGT ACCTGCAGAA 200 

GCCAGGCCAG TCTCCAAAGC TCCTGATCTA CAAAGTTTCC AACCGATTTT 250 

5 

CTGGGGTCCC AGACAGGTTC AGTGGCAGTG GATCAGGGAC AGATTTCACA 300 

CTCAGGATCA GCAGAGTGGA GGCTGAGGAT CTGGGACTTT ATTTCTGCTC 350 

10 TCAAAGTACA CATGTTCCGC TCACGTTCGG TGCTGGGACC AAGCTGGAGC 400 

TGAAACGGGC TGTTGCTGCA CCAACTGTAT TCATCTTCCC ACCATCCAGT 450 

GAGCAATTGA AATCTGGAAC TGCCTCTGTT GTGTGCCTGC TGAATAACTT 500 

15 

CTATCCCAGA GAGGCCAAAG TACAGTGGAA GGTGGATAAC GCCCTCCAAT 550 

CGGGTAACTC CCAGGAGAGT GTCACAGAGC AGGACAGCAA GGACAGCACC 600 

20 TACAGCCTCA GCAGCACCCT GACGCTGAGC AAAGCAGACT ACGAGAAACA 650 

CAAAGTCTAC GCCTGCGAAG TCACCCATCA GGGCCTGAGC TCGCCCGTCA 700 
CAAAGAGCTT CAACAGGGGA GAGTGTTAA 729 
(2) INFORMATION FOR SEQ ID NO: 42: 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 amino acids 
30 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

35 Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 

Ser He Ala Thr Asn Ala Tyr Ala Asp He Val Met Thr Gin Thr 
20 25 30 



40 



55 



Pro Leu Ser Leu Pro Val Ser Leu Gly Asp Gin Ala Ser He Ser 
35 40 45 



Cys Arg Ser Ser Gin Ser Leu Val His Gly He Gly Asn Thr Tyr 

45 50 55 60 

Leu His Trp Tyr Leu Gin Lys Pro Gly Gin Ser Pro Lys Leu Leu 

65 70 75 

50 He Tyr Lys Val Ser Asn Arg Phe Ser Gly Val Pro Asp Arg Phe 

80 85 90 

Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Arg He Ser Arg 

95 100 105 



Val Glu Ala Glu Asp Leu Gly Leu Tyr Phe Cys Ser Gin Ser Thr 

259 



20 



40 



Patent Docket No. P1085R4-1 A 



110 115 120 

His Val Pro Leu Thr Phe Gly Ala Gly Thr Lys Leu Glu Leu Lys 

125 130 135 

5 

Arg Ala Val Ala Ala Pro Thr Val Phe He Phe Pro Pro Ser Ser 

140 145 150 

Glu Gin Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn 
10 155 160 165 

Asn Phe Tyr Pro Arg Glu Ala Lys Val Gin Trp Lys Val Asp Asn 
170 175 180 

15 Ala Leu Gin Ser Gly Asn Ser Gin Glu Ser Val Thr Glu Gin Asp 

185 190 195 



Ser Lys Asp Ser Thr Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser 
200 205 210 

Lys Ala Asp Tyr Glu Lys His Lys Val Tyr Ala Cys Glu Val Thr 
215 220 225 



His Gin Gly Leu Ser Ser Pro Val Thr Lys Ser Phe Asn Arg Gly 
25 230 235 240 

Glu Cys 
242 

30 (2) INFORMATION FOR SEQ ID NO:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 62 base pairs 

(B) TYPE: Nucleic Acid 
35 (CJ STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 





ATGAAAAAGA 


ATATCGCATT 


TCTTCTTGCA 


TCTATGTTCG 


TTTTTTCTAT 


50 




TGCTACAAAC 


GCGTACGCTG 


AGATTCAGCT 


GCAGCAGTCT 


GGACCTGAGC 


100 


45 


TGATGAAGCC 


TGGGGCTTCA 


GTGAAGATAT 


CCTGCAAGGC 


TTCTGGTTAT 


150 




TCATTCAGTA 


GCCACTACAT 


GCACTGGGTG 


AAGCAGAGCC 


ATGGAAAGAG 


200 


50 


CCTTGAGTGG 


ATTGGCTACA 


TTGATCCTTC 


CAATGGTGAA 


ACTACTTACA 


250 


ACCAGAAATT 


CAAGGGCAAG 


GCCACATTGA 


CTGTAGACAC 


ATCTTCCAGC 


300 




ACAGCCAACG 


TGCATCTCAG 


CAGCCTGACA 


TCTGATGACT 


CTGCAGTCTA 


350 


55 


TTTCTGTGCA 


AGAGGGGACT 


ATAGATACAA 


CGGCGACTGG 


TTTTTCGATG 


400 
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10 



TCTGGGGCGC AGGGACCACG GTCACCGTCT CCTCCGCCTC CACCAAGGGC 450 

CCATCGGTCT TCCCCCTGGC ACCCTCCTCC AAGAGCACCT CTGGGGGCAC 500 

AGCGGCCCTG GGCTGCCTGG TC AAGGACTA CTTCCCCGAA CCGGTGACGG 550 

TGTCGTGGAA CTCAGGCGCC CTGACCAGCG GCGTGCACAC CTTCCCGGCT 600 

GTCCTACAGT CCTCAGGACT CTACTCCCTC AGCAGCGTGG TGACCGTGCC 650 

CTCCAGCAGC TTGGGCACCC AGACCTACAT CTGCAACGTG AATCACAAGC 700 

CCAGCAACAC CAAGGTGGAC AAGAAAGTTG AGCCCAAATC TTGTGACAAA 750 

15 ACTCACACAT GA 762 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 253 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 



25 



40 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Met Lys Lys Asn He Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
1 5 10 . 15 



Ser He Ala Thr Asn Ala Tyr Ala Glu 'He Gin Leu Gin Gin Ser 

30 20 25 30 

Gly Pro Glu Leu Met Lys Pro Gly Ala Ser Val Lys He Ser Cys 

35 40 45 

35 Lys Ala Ser Gly Tyr Ser Phe Ser Ser His Tyr Met His Trp Val 

50 55 60 

Lys Gin Ser His Gly Lys Ser Leu Glu Trp He Gly Tyr He Asp 

65 70 75 



Pro Ser Asn Gly Glu Thr Thr Tyr Asn Gin Lys Phe Lys Gly Lys 
80 85 90 



Ala Thr Leu Thr Val Asp Thr Ser Ser Ser Thr Ala Asn Val His 

45 95 100 105 

Leu Ser Ser Leu Thr Ser Asp Asp Ser Ala Val Tyr Phe Cys Ala 

110 115 120 

50 Arg Gly Asp Tyr Arg Tyr Asn Gly Asp Trp Phe Phe Asp Val Trp 

125 130 135 

Gly Ala Gly Thr Thr Val Thr Val Ser Ser Ala Ser Thr Lys Gly 

140 145 150 



Pro Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly 
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155 160 165 

Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu 
170 175 180 

5 

Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val 
185 190 195 

His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu 
10 200 205 210 

Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr 
215 220 225 

15 Tyr lie Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp 

230 235 240 



20 



35 



50 



55 



Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr 
245 250 253 

(2) INFORMATION FOR SEQ ID NO: 45: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 amino acids 
25 <B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

30 As.p lie Val Met Thr Gin Thr Pro Leu Ser Leu Pro Val Ser Leu 
15 10 15 

Gly Asp Gin Ala Ser lie Ser Cys Arg Ser Ser Gin Ser Leu Val 
20 25 30 



His Gly lie Gly Asn Thr Tyr Leu His Trp Tyr Leu Gin Lys Pro 
35 40 45 



Gly Gin Ser Pro Lys Leu Leu lie Tyr Tyr Lys Val Ser Asn Arg 

40 50 55 60 

Phe Ser Gly Val Pro Asp Arg Phe Ser Asp Ser Gly Ser Gly Thr 

65 70 75 

45 Asp Phe Thr Leu Arg lie Ser Arg Val Glu Ala Glu Asp Leu Gly 

80 85 90 

Leu Tyr Phe Cys Ser Gin Ser Thr His Val Pro Leu Thr Phe Gly 

95 100 105 



Ala Gly Thr Lys Leu Glu Leu Lys Arg 
110 114 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 114 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

.5 {xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Asp lie Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val 
15 10 15 

10 Gly Asp Arg Val Thr lie Thr Cys Arg Ser Ser Gin Ser Leu Val 

20 25 30 

His Gly lie Gly Asn Thr Tyr Leu His Trp Tyr Gin Gin Lys Pro 
35 40 45 



15 



30 



45 



Gly Lys Ala Pro Lys Leu Leu lie Tyr Tyr Lys Val Ser Asn Arg 
50 55 60 



Phe Ser Gly Val Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr 

20 65 70 75 

Asp Phe Thr Leu Thr lie Ser Ser Leu Gin Pro Glu Asp Phe Ala 

80 85 90 

25 Thr Tyr Tyr Cys Ser Gin Ser Thr His Val Pro Leu Thr Phe Gly 

95 100 105 



Gin Gly Thr Lys Val Glu lie Lys Arg 
110 114 

(2) INFORMATION FOR SEQ ID NO: 47: 



(i) SEQUENCE CHARACTERISTICS: 
. (A) LENGTH: 109 amino acids 
35 <B) TYPE: PRT 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

40 Asp lie Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val 
1 5 10 15 

Gly Asp Arg Val Thr lie Thr Cys Arg Ala Ser Lys Thr lie Ser 
20 25 30 

Lys Tyr Leu Ala Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys 
35 40 45 



Leu Leu lie Tyr Tyr Ser Gly Ser Thr Leu Glu Ser Gly Val Pro 
50 50 55 60 

Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr 
65 70 75 

55 He Ser Ser Leu Gin Pro Glu Asp Phe Ala Thr Tyr Tyr Cys Gin 

80 85 90 
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Gin His Asn Glu Tyr Pro Leu Thr Phe Gly Gin Gly Thr Lys Val 
95 100 105 

5 Glu lie Lys Arg 
109 

(2) INFORMATION FOR SEQ ID NO: 48: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 
(B> TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Glu lie Gin Leu Gin Gin Ser Gly Pro Glu Leu Met Lys Pro Gly 
15 10 15 

20 Ala Ser Val Lys lie Ser Cys Lys Ala Ser Gly Tyr Ser Phe Ser 

20 25 30 



25 



40 



55 



Ser His Tyr Met His Trp Val Lys Gin Ser His Gly Lys Ser Leu 
35 40 45 

Glu Trp lie Gly Tyr He Asp Pro Ser Asn Gly Glu Thr Thr Tyr 
50 55 60 



Asn Gin Lys Phe Lys Gly Lys Ala Thr Leu Thr Val Asp Thr Ser 

30 65 70 75 

Ser Ser Thr Ala Asn Val His Leu Ser Ser Leu Thr Ser Asp Asp 

80 85 90 

35 Ser Ala Val Tyr Phe Cys Ala Ala Arg Gly Asp Tyr Arg Tyr Asn 

95 100 105 



Gly Asp Trp Phe Phe Asp Val Trp Gly Ala Gly Thr 
110 115 117 

(2) INFORMATION FOR SEQ ID NO: 49: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 
45 (B) TYPE : Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: 

50 Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Gin Pro Gly 
15 10 15 

Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Tyr Ser Phe Ser 
20 25 30 



Ser His Tyr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu 
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35 40 45 

Glu Trp Val Gly Tyr lie Asp Pro Ser Asn Gly Glu Thr Thr Tyr 
50 55 60 

5 

Asn Gin Lys Phe Lys Gly Arg Phe Thr He Ser Arg Asp Asn Ser 
65 70 75 

Lys Asn Thr Leu Tyr Leu Gin Met Asn Ser Leu Arg Ala Glu Asp 
10 80 85 90 

Thr Ala Val Tyr Tyr Cys Ala Ala Arg Gly Asp Tyr Arg Tyr Ash 
95 100 105 

15 Gly Asp Trp Phe Phe Asp Val Trp Gly Gin Gly Thr 

110 115 117 

(2) INFORMATION FOR SEQ ID NO: 50: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 amino acids 

(B) TYPE: PRT 

(D) TOPOLOGY: Linear 

25 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Gin Pro Gly 
15 10 15 

30 Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Ser Phe Thr 

20 25 30 



35 



50 



Gly His Trp Met Asn Trp Val Arg Gin Ala Pro Gly Lys Gly Leu 

35 40 45 

Glu Trp Val Gly Met He His Pro Ser Asp Ser Glu Thr Arg Tyr 

50 55 60 



Ala Asp Ser Val Lys Gly Arg Phe Thr He Ser Arg Asp Asn Ser 

40 65 ,70 75 

Lys Asn Thr Leu Tyr Leu Gin Met Asn Ser Leu Arg Ala Glu Asp 

80 85 90 

45 Thr Ala Val Tyr Tyr Cys Ala Ala Arg Gly He Tyr Phe Tyr Gly 

95 100 105 

Thr Thr Tyr Phe Asp Tyr Trp Gly Gin Gly Thr 

110 115 116 



(2) INFORMATION FOR SEQ ID NO: 51: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 242 amino acids 
55 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 

5 1 5 10 15 

Ser lie Ala Thr Asn Ala Tyr Ala Asp lie Gin Met Thr Gin Ser 

20 25 30 

10 Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg Val Thr He Thr 

35 40 45 



15 



30 



45 



Cys Arg Ser Ser Gin Ser Leu Val His Gly He Gly Asn Thr Tyr 

50 55 60 

Leu His Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu 

65 70 75 



He Tyr Lys Val Ser Asn Arg Phe Ser Gly Val Pro Ser Arg Phe 

20 80 85 90 

Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr He Ser Ser 

95 100 105 

25 Leu Gin Pro Glu Asp Phe Ala Thr Tyr Tyr Cys Ser Gin Ser Thr 

110 115 120 

His Val Pro Leu Thr Phe Gly Gin Gly Thr Lys Val Glu He Lys 

125 130 135 



Arg Thr Val Ala Ala Pro Ser Val Phe He Phe Pro Pro Ser Asp 
140 145 150 



Glu Gin Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn 

35 " 155 160 165 

Asn Phe Tyr Pro Arg Glu Ala Lys Val Gin Trp Lys Val Asp Asn 

170 175 180 

40 Ala Leu Gin Ser Gly Asn Ser Gin Glu Ser Val Thr Glu Gin Asp 

185 190 195 



Ser Lys Asp Ser Thr Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser 

200 205 210 

Lys Ala Asp Tyr Glu Lys His Lys Val Tyr Ala Cys Glu Val Thr 

215 220 225 



His Gin Gly Leu Ser Ser Pro Val Thr Lys Ser Phe Asn Arg Gly 
50 230 235 240 

Glu Cys 
242 

55 (2) INFORMATION FOR SEQ ID NO: 52: 
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10 



25 



40 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 253 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 

Ser lie Ala Thr Asn Ala Tyr Ala Glu Val Gin Leu Val Gin Ser 
20 25 30 



Gly Gly Gly Leu Val Gin Pro Gly Gly Ser Leu Arg Leu Ser Cys 

15 35 40 45 

Ala Ala Ser Gly Tyr Ser Phe Ser Ser His Tyr Met His Trp Val 

50 55 60 

20 Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val Gly Tyr lie Asp 

65 70 75 

Pro Ser Asn Gly Glu Thr Thr Tyr Asn Gin Lys Phe Lys Gly Arg 

80 85 90 



Phe Thr Leu Ser Arg Asp Asn Ser Lys Asn Thr Ala Tyr Leu Gin 
95 100 105 



Met Asn Ser Leu Arg Ala Glu Asp Thr "Ala Val Tyr Tyr Cys Ala 

30 110 115 120 

Arg Gly Asp Tyr Arg Tyr Asn Gly Asp Trp Phe Phe Asp Val Trp 

125 130 135 

35 Gly Gin Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly 

140 145 150 

Pro Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly 

155 160 165 



Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu 
170 175 180 



Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val 

45 185 190 195 

His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu 

200 205 210 

50 Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr 

215 220 225 

Tyr lie Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp 

230 235 240 



Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr 
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245 250 253 

(2) INFORMATION FOR SEQ ID NO: 53: 

5 <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

10 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 

Ser Gly Gly Gly Ser Gly Ser Gly Asp Phe Asp Tyr Glu Lys Met 
15 10 15 

15 Ala Asn Ala Asn Lys Gly Ala Met Thr Glu Asn Ala Asp Glu Asn 

20 25 30 

Ala Leu Gin Ser Asp Ala Lys Gly Lys Leu Asp Ser Val Ala Thr 
35 40 45 



20 



35 



55 



Asp Tyr Gly Ala Ala lie Asp Gly Phe lie Gly Asp Val Ser Gly 
50 55 60 



Leu Ala Asn Gly Asn Gly Ala Thr Gly Asp Phe Ala Gly Ser Ser 

25 65 70 75 

Asn Ser Gin Met Ala Gin Val Gly Asp Gly Asp Asn Ser Pro Leu 

80 85 90 

30 Met Asn Asn Phe Arg Gin Tyr Leu 'Pro Ser Leu Pro Gin Ser Val 

95 100 105 

Glu Cys Arg Pro Phe Val Phe Ser Ala Gly Lys Pro Tyr Glu Phe 

110 115 120 



Ser He Asp Cys Asp Lys He Asn Leu Phe Arg Gly Val Phe Ala 
125 130 135 



Phe Leu Leu Tyr Val Ala Thr Phe Met Tyr Val Phe Ser Thr Phe 
40 140 145 150 

Ala Asn He Leu Arg Asn Lys Glu Ser 
155 159 

45 - (2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 780 base pairs 

(B) TYPE: Nucleic Acid 
50 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 



ATGAAAAAGA ATATCGCATT TCTTCTTGCA TCTATGTTCG TTTTTTCTAT 50 
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TGCTACAAAC GCATACGCTG ATATCCAGAT GACCCAGTCC CCGAGCTCCC 100 
TGTCCGCCTC TGTGGGCGAT AGGGTCACCA TCACCTGCAG GTCAAGTCAA 150 

5 

AGCTTAGTAC ATGGTATAGG TAACACGTAT TTACACTGGT ATCAACAGAA 200 
ACCAGGAAAA GCTCCGAAAC TACTGATTTA CAAAGTATCC AATCGATTCT 250 
10 CTGGAGTCCC TTCTCGCTTC TCTGGATCCG GTTCTGGGAC GG ATTTCACT 300 
CTGACCATCA GCAGTCTGCA GCCAGAAGAC TTCGCAACTT ATTACTGTTC 350 
ACAGAGTACT CATGTCCCGC TCACGTTTGG ACAGGGTACC AAGGTGGAGA 400 

15 

TCAAACGAAC TGTGGCTGCA CCATCTGTCT TCATCTTCCC GCCATCTGAT 450 
GAGCAGTTGA AATCTGGAAC TGCTTCTGTT GTGTGCCTGC TGAATAACTT 500 
20 CTATCCCAGA GAGGCCAAAG TACAGTGGAA GGTGGATAAC GCCCTCCAAT 550 
CGGGTAACTC CCAGGAGAGT GTCACAGAGC AGGACAGCAA GGACAGCACC 600 
TACAGCCTCA GCAGCACCCT GACGCTGAGC AAAGCAGACT ACGAGAAACA 650 

25 

CAAAGTCTAC GCCTGCGAAG TCACCCATCA GGGCCTGAGC TCGCCCGTCA 7 00 

CAAAGAGCTT CAACAGGGGA GAGTGTTAAG CTGATCCTCT ACGCCGGACG 7 50 

30 CATCGTGGCC CTAGTACGCA ACTAGTCGTA 780 

(2) INFORMATION FOR SEQ ID NO;55: 

U) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 253 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Met Lys Lys Asn lie Ala. Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 



Ser He Ala Thr Asn Ala Tyr Ala Glu Val Gin Leu Val Glu Ser 
45 20 25 30 

Gly Gly Gly Leu Val Gin Pro Gly Gly Ser Leu Arg Leu Ser Cys 
35 40 45 

50 Ala Ala Ser Gly Tyr Ser Phe Ser Ser His Tyr Met His Trp Val 

50 55 4 60 

Lys Gin Ala Pro Gly Lys Gly Leu Glu Trp Val Gly Tyr He Asp 
65 70 75 



Pro Ser Asn Gly Glu Thr Thr Tyr Asn Gin Lys Phe Lys Gly Arg 
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80 85 90 

Phe Thr Leu Ser Arg Asp Asn Ser Lys Asn Thr Ala Tyr Leu Gin 

95 100 105 

5 

Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Ala 

110 115 120 

Arg Gly Asp Tyr Arg Tyr Asn Gly Asp Trp Phe Phe Asp Val Trp 
10 125 130 135 

Gly Gin Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly 
140 145 150 

15 Pro Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly 

155 160 165 



20 



35 



50 



Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu 
170 175 180 

Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val 
185 190 195 



His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu 

25 200 205 210 

Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr 

215 220 225 

30 Tyr lie Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp 

230 235 240 

Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr 

245 250 253 



(2) INFORMATION FOR SEQ ID NO: 56: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 amino acids 
40 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

45 Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
1 5 10 . 15 

Ser lie Ala Thr Asn Ala Tyr Ala Asp lie Gin Met Thr Gin Ser 
20 25 30 



Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg Val Thr lie Thr 
35 40 45 



Cys Arg Ser Ser Gin Ser Leu Val His Gly lie Gly Ala Thr Tyr 
55 50 55 60 
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Leu His Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu 

65 70 75 

lie Tyr Lys Val Ser Asn Arg Phe Ser Gly Val Pro Ser Arg Phe 

5 80 85 90 

Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr lie Ser Ser 

95 100 105 

10 Leu Gin Pro Glu Asp Phe Ala Thr Tyr Tyr Cys Ser Gin Ser Thr 

110 115 120 



15 



30 



His Val Pro Leu Thr Phe Gly Gin Gly Thr Lys Val Glu lie Lys 

125 130 135 

Arg Thr Val Ala Ala Pro Ser Val Phe lie Phe Pro Pro Ser Asp 

140 145 150 



Glu Gin Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn 

20 155 160 165 

Asn Phe Tyr Pro Arg Glu Ala Lys Val Gin Trp Lys Val Asp Asn 

170 175 180 

25 Ala Leu Gin Ser Gly Asn Ser Gin Glu Ser Val Thr Glu Gin Asp 

185 190 195 



Ser Lys Asp Ser Thr Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser 

200 205 210 

Lys Ala Asp Tyr Glu Lys His Lys Val Tyr Ala Cys Glu Val Thr 

215 220 225 



His Gin Gly Leu Ser Ser Pro Val Thr Lys Ser Phe Asn Arg Gly 
35 230 235 240 

Glu Cys 
242 

40 (2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: Amino Acid 
45 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly Arg Met Lys 
50 1 5 10 15 

Gin Leu Glu Asp Lys Val Glu Glu Leu Leu Ser Lys Asn Tyr His 
20 25 30 

55 Leu Glu Asn Glu Val Ala Arg Leu Lys Lys Leu Val Gly Glu Arg 

35 40 45 
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20 



(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 780 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 



ATGAAAAAGA ATATCGCATT TCTTCTTGCA TCTATGTTCG TTTTTTCTAT 50 
15 TGCTACAAAC GC ATACGCTG ATATCCAGAT GACCCAGTCC CCGAGCTCCC 100 
TGTCCGCCTC TGTGGGCGAT AGGGTCACCA TCACCTGCAG GTCAAGTCAA 150 
AGCTTAGTAC ATGGTATAGG TGCTACGTAT TTACACTGGT ATCAACAGAA 200 
ACCAGGAAAA GCTCCGAAAC TACTGATTTA CAAAGTATCC AATCGATTCT 250 
CTGGAGTCCC TTCTCGCTTC TCTGGATCCG GTTCTGGGAC GGATTTCACT 300 
25 CTGACCATCA GCAGTCTGCA GCCAGAAGAC TTCGCAACTT ATTACTGTTC 3 50 
ACAGAGTACT CATGTCCCGC TCACGTTTGG ACAGGGTACC AAGGTGGAGA 400 
TCAAACGAAC TGTGGCTGCA CCATCTGTCT TCATCTTCCC GCCATCTGAT 450 

30 

GAGCAGTTGA AATCTGGAAC TGCTTCTGTT GTGTGCCTGC TGAATAACTT 500 
CTATCCCAGA GAGGCCAAAG TACAGTGGAA GGTGGATAAC GCQCTCC AAT 550 
35 CGGGTAACTC CCAGGAGAGT GTCACAGAGC AGGACAGCAA GGACAGCACC 600 
TACAGCCTCA GCAGCACCCT GACGCTGAGC AAAGCAGACT ACGAGAAACA 650 
CAAAGTCTAC GCCTGCGAAG TCACCCATCA GGGCCTGAGC TCGCCCGTCA 700 

40 

CAAAGAGCTT CAACAGGGGA GAGTGTTAAG CTGATCCTCT ACGCCGGACG 750 
CATCGTGGCC CTAGTACGCA ACTAGTCGTA 7 80 
45 (2) INFORMATION FOR SEQ ID NO: 59: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 927 base pairs 

(B) TYPE: Nucleic Acid 
50 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

AAAAGGGTAT CTAGAGGTTG AGGTGATTTT ATGAAAAAGA ATATCGCATT 50 
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TCTTCTTGCA TCTATGTTCG TTTTTTCTAT TGCTACAAAC GCGTACGCTG 100 
AGGTTCAGCT AGTGCAGTCT GGCGGTGGCC TGGTGCAGCC AGGGGGCTCA 150 

5 

CTCCGTTTGT CCTGTGCAGC TTCTGGCTAC TCCTTCTCGA GTCACTATAT 200 
GCACTGGGTC CGTCAGGCCC CGGGTAAGGG CCTGGAATGG GTTGGATATA 250 
10 TTGATCCTTC CAATGGTGAA ACTACGTATA ATCAAAAGTT CAAGGGCCGT 300 
TTCACTTTAT CTCGCGACAA CTCCAAAAAC ACAGCATACC TGCAGATGAA 350 
CAGCCTGCGT GCTGAGGACA CTGCCGTCTA TTACTGTGCA AGAGGGGATT 400 

15 

ATCGCTACAA TGGTGACTGG TTCTTCGACG TCTGGGG TCA AGGAACCCTG 4 50 
GTCACCGTCT CCTCGGCCTC CACCAAGGGC CCATCGGTCT TCCCCCTGGC 500 
20 ACCCTCCTCC AAGAGCACCT CTGGGGGCAC AGCGGCCCTG GGCTGCCTGG 550 
TCAAGGACTA CTTCCCCGAA CCGGTGACGG TGTCGTGGAA CTCAGGCGCC 600 
CTGACCAGCG GCGTGCACAC CTTCCCGGCT GTCCTACAGT CCTCAGGACT 650 

25 

CTACTCCCTC AGCAGCGTGG TGACCGTGCC CTCCAGCAGC TTGGGCACCC 700 
AGACCTACAT CTGCAACGTG AATCACAAGC CCAGCAACAC CAAGGTCGAC 750 
30 AAGAAAGTTG AGCCCAAATC TTGTGACAAA ACTCACACAT GCCCGCCGTG 800 
CCCAGCACCA GAACTGCTGG GCGGCCGCAT' GAAACAGCTA GAGGACAAGG 850 
TCGAAGAGCT ACTCTCCAAG AACTACCACC TAGAGAATGA AGTGGCAAGA 900 

35 

CTCAAAAAGC TTGTCGGGGA GCGCTAA 927 
(2) INFORMATION FOR SEQ ID NO: 60: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 298 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 

50 Ser He Ala Thr Asn Ala Tyr Ala Glu Val Gin Leu Val Gin Ser 

20 25 30 

Gly Gly Gly Leu Val Gin Pro Gly Gly Ser Leu Arg Leu Ser Cys 
35 40 45 

55 

Ala Ala Ser Gly Tyr Ser Phe Ser Ser His Tyr Met His Trp Val 
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50 55 60 

Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val Gly Tyr lie Asp 

65 70 75 

Pro Ser Asn Gly Glu Thr Thr Tyr Asn Gin Lys Phe Lys Gly Arg 

80 85 90 

Phe Thr Leu Ser Arg Asp Asn Ser Lys Asn Thr Ala Tyr Leu Gin 

95 100 105 

Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Ala 

110 115 120 

Arg Gly Asp Tyr Arg Tyr Asn Gly Asp Trp Phe Phe Asp Val Trp 

125 130 135 

Gly Gin Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly 

140 145 150 

Pro Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly 

155 160 165 

Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu 

170 175 180 

Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val 

185 190 195 

His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu 

200 205 210 

Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr 

215 220 225 

Tyr lie Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp 

230 235 240 

Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro 

245 250 255 

Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly Arg Met Lys Gin Leu 

260 265 270 

Glu Asp Lys Val Glu Glu Leu Leu Ser Lys Asn Tyr His Leu Glu 

275 280 285 

Asn Glu Val Ala Arg Leu Lys Lys Leu Val Gly Glu Arg 
290 295 298 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6563 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 
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(D) TOPOLOGY: Linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

5 

GAATTCAACT TCTCCATACT TTGGATAAGG AAATACAGAC ATGAAAAATC 50 
TCATTGCTGA GTTGTTATTT AAGCTTGCCC AAAAAGAAGA AGAGTCGAAT 100 
10 GAACTGTGTG CGCAGGTAGA AGCTTTGGAG ATTATCGTCA CTGCAATGCT 150 
TCGCAATATG GCGCAAAATG ACCAACAGCG GTTGATTGAT CAGGTAGAGG 200 
GGGCGCTGTA CGAGGTAAAG CCCGATGCCA GCATTCCTGA CGACGATACG 250 

15 

GAGCTGCTGC GCGATTACGT AAAGAAGTTA TTGAAGCATC CTCGTCAGTA 300 
AAAAGTTAAT CTTTTCAACA GCTGTCATAA AGTTGTCACG GCCGAGACTT 3 50 
20 ATAGTCGCTT TGTTTTTATT TTTTAATGTA TTTGTAACTA GAATTCGAGC 400 
TCGGTACCCG GGGATCCTCT CGAGGTTGAG GTGATTTTAT GAAAAAGAAT 450 
ATCGCATTTC TTCTTGCATC TATGTTCGTT TTTTCTATTG CTACAAACGC 500 

25 

ATACGCTGAT ATCCAGATGA CCCAGTCCCC GAGCTCCCTG TCCGCCTCTG 550 
TGGGCGATAG GGTCACCATC ACCTGCAGGT CAAGTCAAAG CTTAGTACAT 600 
30 GGTATAGGTG CTACGTATTT ACACTGGTAT CAACAGAAAC CAGGAAAAGC 650 
TCCGAAACTA CTGATTTACA AAGTATCCAA TCGATTCTCT GGAGTCCCTT 7 00 
CTCGCTTCTC TGGATCCGGT TCTGGGACGG ATTTCACTCT GACCATCAGC 7 50 

35 

AGTCTGCAGC CAGAAGACTT CGCAACTTAT TACTGTTCAC AGAGTACTCA 800 
TGTCCCGCTC ACGTTTGGAC AGGGTACCAA GGTGGAGATC AAACGAACTG 850 
40 TGGCTGCACC ATCTGTCTTC ATCTTCCCGC CATCTGATGA GCAGTTGAAA 900 
TCTGGAACTG CTTCTGTTGT GTGCCTGCTG AATAACTTCT ATCCCAGAGA 950 
GGCCAAAGTA CAGTGGAAGG TGGATAACGC CCTCCAATCG GGTAACTCCC 1000 

45 

AGGAGAGTGT CACAGAGCAG GACAGCAAGG ACAGCACCTA CAGCCTCAGC 1050 
AGCACCCTGA CGCTGAGCAA AGCAGACTAC GAGAAACACA AAGTCTACGC 1100 
50 CTGCGAAGTC ACCCATCAGG GCCTGAGCTC GCCCGTCACA AAGAGCTTCA 1150 
ACAGGGGAGA GTGTTAAGCT GATCCTCTAC GCCGGACGCA TCGTGGCCCT 1200 
AGTACGCAAC TAGTCGTAAA AAGGGTATCT AGAGGTTGAG GTGATTTTAT 1250 
GAAAAAGAAT ATCGCATTTC TTCTTGCATC TATGTTCGTT TTTTCTATTG 1300 
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CTACAAACGC GTACGCTGAG GTTCAGCTAG TGCAGTCTGG CGGTGGCCTG 13 50 

GTGCAGCCAG GGGGCTCACT CCGTTTGTCC TGTGCAGCTT CTGGCTACTC 1400 

5 

CTTCTCGAGT CACTATATGC ACTGGGTCCG TCAGGCCCCG GGTAAGGGCC 1450 

TGGAATGGGT TGGATATATT GATCCTTCCA ATGGTGAAAC TACGTATAAT 1500 

10 CAAAAGTTCA AGGGCCGTTT CACTTTATCT CGCGACAACT CCAAAAACAC 1550 

AGCATACCTG CAGATGAACA GCCTGCGTGC TGAGGACACT GCCGTCTATT 1600 

ACTGTGCAAG AGGGG ATTAT CGCTACAATG GTGACTGGTT CTTCGACGTC 1650 

15 

TGGGGTCAAG GAACCCTGGT CACCGTCTCC TCGGCCTCCA CCAAGGGCCC 1700 

ATCGGTCTTC CCCCTGGCAC CCTCCTCCAA GAGCACCTCT GGGGGCACAG 17 50 

20 CGGCCCTGGG CTGCCTGGTC AAGGACTACT TCCCCGAACC GGTGACGGTG 1800 

TCGTGGAACT CAGGCGCCCT GACCAGCGGC GTGCACACCT TCCCGGCTGT 1850 

CCTACAGTCC TCAGGACTCT ACTCCCTCAG CAGCGTGGTG ACCGTGCCCT 19 00 

25 

CCAGCAGCTT GGGCACCCAG ACCTACATCT GCAACGTGAA TCACAAGCCC 1950 

AGCAACACCA AGGTCGACAA GAAAGTTGAG CCCAAATCTT GTGACAAAAC 2000 

30 TCACACATGC CCGCCGTGCC CAGCACCAGA ACTGCTGGGC GGCCGCATGA 2050 

AACAGCTAGA GGACAAGGTC GAAGAGCTAC TCTCCAAGAA CTACCACCTA 2100 

GAGAATGAAG TGGCAAGACT CAAAAAGCTT GTCGGGGAGC GCTAAGCATG 2150 

35 

CGACGGCCCT AGAGTCCCTA ACGCTCGGTT GCCGCCGGGC GTTTTTTATT 2200 

GTTAACTCAT GTTTGACAGC TTATCATCGA TAAGCTTTAA TGCGGTAGTT 2250 

40 TATCACAGTT AAATTGCTAA CGCAGTCAGG CACCGTGTAT GAAATCTAAC 2300 

AATGCGCTCA TCGTCATCCT CGGCACCGTC ACCCTGGATG CTGTAGGCAT 23 50 

AGGCTTGGTT ATGCCGGTAC TGCCGGGCCT CTTGCGGGAT ATCGTCCATT 2400 

45 

CCGACAGCAT CGCCAGTCAC TATGGCGTGC TGCTAGCGCT ATATGCGTTG 2450 

ATGCAATTTC TATGCGCACC CGTTCTCGGA GCACTGTCCG ACCGCTTTGG 2500 

50 CCGCCGCCCA GTCCTGCTCG CTTCGCTACT TGGAGCCACT ATCGACTACG 2550 

CGATCATGGC GACCACACCC GTCCTGTGGA TCCTCTACGC CGGACGCATC 2600 

GTGGCCGGCA TCACCGGCGC CACAGGTGCG GTTGCTGGCG CCTATATCGC 2650 

55 

CGACATCACC GATGGGGAAG ATCGGGCTCG CCACTTCGGG CTCATGAGCG 27 00 
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CTTGTTTCGG CGTGGGTATG GTGGCAGGCC CCGTGGCCGG GGGACTGTTG 2750 

GGCGCCATCT CCTTGCACGC ACCATTCCTT GCGGCGGCGG TGCTCAACGG 2800 

5 

CCTCAACCTA CTACTGGGCT GCTTCCTAAT GCAGGAGTCG CATAAGGGAG 2850 

AGCGTCGTCC GATGCCCTTG AGAGCCTTCA ACCCAGTCAG CTQCTTCCGG 2900 

10 TGGGCGCGGG GCATGACTAT CGTCGCCGCA CTTATGACTG TCTTCTTTAT 2950 

CATGCAACTC GTAGGACAGG TGCCGGCAGC GCTCTGGGTC ATTTTCGGCG 3000 

AGGACCGCTT TCGCTGGAGC GCGACGATGA TCGGCCTGTC GCTTGCGGTA 3050 

15 

TTCGGAATCT TGCACGCCCT CGCTCAAGCC TTCGTCACTG GTCCCGCCAC 3100 

CAAACGTTTC GGCGAGAAGC AGGCCATTAT CGCCGGCATG GCGGCCGACG 3150 

20 CGCTGGGCTA CGTCTTGCTG GCGTTCGCGA CGCGAGGCTG GATGGCCTTC 3200 

CCCATTATGA TTCTTCTCGC TTCCGGCGGC ATCGGGATGC CCGCGTTGCA 3250 

GGCCATGCTG TCCAGGCAGG TAGATGACGA CCATCAGGGA CAGCTTCAAG 3300 

25 

GATCGCTCGC GGCTCTTACC AGCCTAACTT CGATCACTGG ACCGCTGATC 3 3 50 

GTCACGGCGA TTTATGCCGC CTCGGCGAGC ACATGGAACG GGTTGGCATG 3400 

30 GATTGTAGGC GCCGCCCTAT ACCTTGTCTG CCTCCCCGCG TTGCGTCGCG 3450 

GTGCATGGAG CCGGGCCACC TCGACCTGAA TGGAAGCCGG CGGCACCTCG 3500 

CTA&CGGATT CACCACTCCA AGAATTGGAG CCAATCAATT CTTGCGGAGA 3550 

35 

ACTGTGAATG CGCAAACCAA CCCTTGGCAG AACATATCCA TCGCGTCCGC 3600 

CATCTCCAGC AGCCGCACGC GGCGCATCTC GGGCAGCGTT GGGTCCTGGC 3650 

40 CACGGGTGCG CATGATCGTG CTCCTGTCGT TGAGGACCCG GCTAGGCTGG 3700 

CGGGGTTGCC TTACTGGTTA GCAGAATGAA TCACCGATAC GCGAGCGAAC 37 50 

GTGAAGCGAC TGCTGCTGCA AAACGTCTGC GACCTGAGCA ACAACATGAA 3 800 

45 

TGGTCTTCGG TTTCCGTGTT TCGTAAAGTC TGGAAACGCG GAAGTCAGCG 3850 

CCCTGCACCA TTATGTTCCG GATCTGCATC GCAGGATGCT GCTGGCTACC 3900 

50 CTGTGGAACA CCTACATCTG TATTAACGAA GCGCTGGCAT TGACCCTGAG 3950 

TGATTTTTCT CTGGTCCCGC CGCATCCATA CCGCCAGTTG TTTACCCTCA 4000 

CAACGTTCCA GTAACCGGGC ATGTTCATCA TCAGTAACCC GTATCGTGAG 4050 

55 

CATCCTCTCT CGTTTCATCG GTATCATTAC CCCCATGAAC AGAAATTCCC 4100 
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CCTTACACGG AGGCATCAAG TGACCAAACA GGAAAAAACC GCCCTTAACA 4150 
TGGCCCGCTT TATCAGAAGC CAGACATTAA CGCTTCTGGA GAAACTCAAC 4200 

5 

GAGCTGGACG CGGATGAACA GGCAGACATC TGTGAATCGC TTCACGACCA 4250 
CGCTGATGAG CTTTACCGCA GCTGCCTCGC GCGTTTCGGT GATGACGGTG 4300 
10 AAAACCTCTG ACACATGCAG CTCCCGGAGA CGGTCACAGC TTGTCTGTAA 4350 
GCGGATGCCG GGAGCAGACA AGCCCGTCAG GGCGCGTCAG CGGGTGTTGG 4400 
CGGGTGTCGG GGCGCAGCCA TGACCCAGTC ACGTAGCGAT AGCGGAGTGT 4450 

15 

ATACTpGCTT AACTATGCGG CATCAGAGCA GATTGTACTG AGAGTGCACC 4500 
ATATGCGGTG TGAAATACCG CACAGATGCG TAAGGAGAAA ATACCGCATC 4550 
20 AGGCGCTCTT CCGCTTCCTC GCTCACTGAC TCGCTGCGCT CGGTCGTTCG 4600 
GCTGCGGCGA GCGGTATCAG CTCACTCAAA GGCGGTAATA CGGTTATCCA 4650 
CAGAATCAGG GGATAACGCA GGAAAGAACA TGTGAGCAAA AGGCCAGCAA 4700 

25 

AAGGCCAGGA ACCGTAAAAA GGCCGCGTTG CTGGCGTTTT TCCATAGGCT 4750 
CCGCCCCCCT GACGAGCATC ACAAAAATCG ACGCTCAAGT CAGAGGTGGC 4800 
30 GAAACCCGAC AGGACTATAA AGATACCAGG CGTTTCCCCC TGGAAGCTCC 4850 
CTCGTGCGCT CTCCTGTTCC GACCCTGCCG CTTACCGGAT ACCTGTCCGC 4900 
CTTTCTCCCT TCGGGAAGCG TGGCGCTTTC TCATAGCTCA CGCTGTAGGT 4 950 

35 

ATCTCAGTTC GGTGTAGGTC GTTCGCTCCA AGCTGGGCTG TGTGCACGAA 5000 
CCCCCCGTTC AGCCCGACCG CTGCGCCTTA TCCGGTAACT ATCGTCTTGA 5050 
40 GTCCAACCCG GTAAGACACG ACTTATCGCC ACTGGCAGCA GCCACTGGTA 5100 
ACAGGATTAG CAGAGCGAGG TATGTAGGCG GTGCTACAGA GTTCTTGAAG 5150 
TGGTGGCCTA ACTACGGCTA CACTAGAAGG ACAGTATTTG GTATCTGCGC 5200 

45 

TCTGCTGAAG CCAGTTACCT TCGGAAAAAG AGTTGGTAGC TCTTGATCCG 5250 
GCAAACAAAC CACCGCTGGT AGCGGTGGTT TTTTTGTTTG CAAGCAGCAG 5300 
50 ATTACGCGCA GAAAAAAAGG ATCTCAAGAA GATCCTTTGA TCTTTTCTAC 5350 
GGGGTCTGAC GCTCAGTGGA ACGAAAACTC ACGTTAAGGG ATTTTGGTCA 5400 
TGAGATTATC AAAAAGGATC TTCACCTAGA TCCTTTTAAA TTAAAAATGA 5450 
AGTTTTAAAT CAATCTAAAG TATATATGAG TAAACTTGGT CTGACAGTTA 5500 
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CCAATGCTTA ATCAGTGAGG CACCTATCTC AGCGATCTGT CTATTTCGTT 5550 
CATCCATAGT TGCCTGACTC CCCGTCGTGT AGATAACTAC GATACGGGAG 5600 

5 

GGCTTACCAT CTGGCCCCAG TGCTGCAATG ATACCGCGAG ACCCACGCTC 5650 
ACCGGCTCCA GATTTATCAG CAATAAACCA GCCAGCCGGA AGGGCCG AGC 5700 
!0 GCAGAAGTGG TCCTGCAACT TTATCCGCCT CCATCCAGTC TATTAATTGT 5750 
TGCCGGGAAG CTAGAGTAAG TAGTTCGCCA GTTAATAGTT TGCGCAACGT 5800 
TGTTGCCATT GCTGCAGGCA TCGTGGTGTC ACGCTCGTCG TTTGGTATGG 5850 

15 

CTTCATTCAG CTCCGGTTCC CAACGATCAA GGCGAGTTAC ATGATCCCCC 5900 
ATGTTGTGCA AAAAAGCGGT TAGCTCCTTC GGTCCTCCGA TCGTTGTCAG 5950 
20 AAGTAAGTTG GCCGCAGTGT TATCACTCAT GGTTATGGCA GCACTGCATA 6000 
ATTCTCTTAC TGTCATGCCA TCCGTAAGAT GCTTTTCTGT GACTGGTGAG 6050 
TACTCAACCA AGTCATTCTG AGAATAGTGT ATGCGGCGAC CGAGTTGCTC 6100 

25 

TTGCCCGGCG TCAACACGGG ATAATACCGC GCCACATAGC AGAACTTTAA 6150 
AAGTGCTCAT CATTGGAAAA CGTTCTTCGG GGCGAAAACT CTCAAGGATC 6200 
30 TTACCGCTGT TGAGATCCAG TTCGATGTAA CCCACTCGTG CACCCAACTG 6250 
ATCTTCAGCA TCTTTTACTT TCACCAGCGT " TTCTGGGTGA GCAAAAACAG 6300 
GAAGGCAAAA TGCCGCAAAA AAGGGAATAA GGGCGACACG GAAATGTTGA 63 50 

35 

ATACTCATAC TCTTCCTTTT TCAATATTAT TGAAGCATTT ATCAGGGTTA 64 00 
TTGTCTCATG AGCGGATACA TATTTGAATG TATTTAGAAA AATAAACAAA 6450 
40 TAGGGGTTCC GCGCACATTT CCCCGAAAAG TGCCACCTGA CGTCTAAGAA 6500 
ACCATTATTA TCATGACATT AAC C TAT AAA AATAGGCGTA TCACGAGGCC 6550 
CTTTCGTCTT CAA 6563 

45 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 amino acids 
50 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

55 Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 
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10 



25 



40 



Ser He Ala Thr Asn Ala Tyr Ala Asp He Gin Met Thr Gin Ser 

20 25 30 

Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg Val Thr He Thr 

35 40 45 

Cys Arg Ser Ser Gin Ser Leu Val His Gly He Gly Glu Thr Tyr 

50 55 60 

Leu His Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu 

65 70 75 



He Tyr Lys Val Ser Asn Arg Phe Ser Gly Val Pro Ser Arg Phe 

15 80 85 90 

Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr He Ser Ser 

95 100 105 

20 Leu Gin Pro Glu Asp Phe Ala Thr Tyr Tyr Cys Ser Gin Ser Thr 

110 115 120 

His Val Pro Leu Thr Phe Gly Gin Gly Thr Lys Val Glu He Lys 

125 130 135 



Arg Thr Val Ala Ala Pro Ser Val Phe He Phe Pro Pro Ser Asp 
140 145 150 



Glu Gin Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn 

30 155 160 165 

Asn Phe Tyr Pro Arg Glu Ala Lys Val Gin Trp Lys Val Asp Asn 

170 175 180 

35 Ala Leu Gin Ser Gly Asn Ser Gin Glu Ser Val Thr Glu Gin Asp 

185 190 195 

Ser Lys Asp Ser Thr Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser 

200 205 210 



Lys Ala Asp Tyr Glu Lys His Lys Val Tyr Ala Cys Glu Val Thr 
215 220 225 



His Gin Gly Leu Ser Ser Pro Val Thr Lys Ser Phe Asn Arg Gly 
45 230 235 240 

Glu Cys 
242 

50 (2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
<B) TYPE: Nucleic Acid 
55 (C) STRAND EDNESS : Single 

(D) TOPOLOGY: Linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

5 CATGGTATAG GTTAAACTTA TTTACAC 27 

(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 
10 {A) LENGTH: 27 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

CATGGTATAG GTNNSACTTA TTTACAC 27 
20 (2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 780 base pairs 

(B) TYPE: Nucleic Acid 
25 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 





ATGAAAAAGA 


ATATCGCATT 


TCTTCTTGCA 


TCTATGTTCG 


TTTTTTCTAT 


50 




TGCTACAAAC 


GCATACGCTG 


ATATCCAGAT 


GACCCAGTCC 


CCGAGCTCCC 


100 


35 


TGTCCGCCTC 


TGTGGGCGAT 


AGGGTCACCA 


TCACCTGCAG 


GTCAAGTCAA 


150 




AGCTTAGTAC 


ATGGTATAGG 


TGAGACGTAT 


TTACACTGGT 


ATCAACAGAA 


200 


40 


ACCAGGAAAA 


GCTCCGAAAC 


TACTGATTTA 


CAAAGTATCC 


AATCGATTCT 


250 


CTGGAGTCCC 


TTCTCGCTTC 


TCTGGATCCG 


GTTCTGGGAC 


GGATTTCACT 


300 




CTGACCATCA 


GCAGTCTGCA 


GCCAGAAGAC 


TTCGCAACTT 


ATT AC TGTTC 


350 


45 


ACAGAGTACT 


CATGTCCCGC 


TCACGTTTGG 


ACAGGGTACC 


AAGGTGGAGA 


400 




TCAAACGAAC 


TGTGGCTGCA 


CCATCTGTCT 


TCATCTTCCC 


GCCATCTGAT 


450 


50 


GAGCAGTTGA 


AATCTGGAAC 


TGCTTCTGTT 


GTGTGCCTGC 


TGAATAACTT 


500 


CTATCCCAGA 


GAGGC CAAAG 


TACAGTGGAA 


GGTGGATAAC 


GCCCTCCAAT 


550 




CGGGTAACTC 


CCAGGAGAGT 


GTCACAGAGC 


AGGACAGCAA 


GGACAGCACC 


600 


55 


TACAGCCTCA 


GCAGCACCCT 


GACGCTGAGC 


AAAGCAGACT 


ACGAGAAACA 


650 
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CAAAGTCTAC GCCTGCGAAG TCACCCATCA GGGCCTGAGC TCGCCCGTCA 700 

CAAAGAGCTT CAACAGGGGA GAGTGTTAAG CTGATCCTCT ACGCCGGACG 7 50 

5 CATCGTGGCC CTAGTACGCA ACTAGTCGTA 7 80 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 78 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

CTAGTGCAGT CTGGCGGTGG CCTGGTGCAG CCAGGGGGCT CACTCCGTTT 50 

20 GTCCTGTGCA GCTTCTGGCT ACTCCTTC 78 

n (2) INFORMATION FOR SEQ ID NO: 67: 

^! (i) SEQUENCE CHARACTERISTICS: 

,|f25 (A) LENGTH: 82 base pairs 

3* (B) TYPE: Nucleic Acid 

If (C) STRANDEDNESS: Single 

Jl (D) TOPOLOGY: Linear 

iS| 30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

TCGAGAAGGA GTAGCCAGAA GCTGCACAGG ACAAACGGAG TGAGCCCCCT 50 

a a . 

ibS GGCTGCACCA GGCCACCGCC AGACTGCACT AG 82 

sa ? 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 8120 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

TTCGAGCTCG CCCGACATTG ATTATTGACT AGAGTCGATC GACAGCTGTG 50 
50 GAATGTGTGT CAGTTAGGGT GTGGAAAGTC CCCAGGCTCC CCAGCAGGCA 100 
GAAGTATGCA AAGCATGCAT CTCAATTAGT CAGCAACCAG GTGTGGAAAG 150 
TCCCCAGGCT CCCCAGCAGG CAGAAGTATG CAAAGCATGC ATCTCAATTA 200 

55 

GTCAGCAACC ATAGTCCCGC CCCTAACTCC GCCCATCCCG CCCCTAACTC 250 
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CGCCCAGTTC CGCCCATTCT CCGCCCCATG GCTGACTAAT TTTTTTTATT 300 

TATGCAGAGG CCGAGGCCGC CTCGGCCTCT GAGCTATTCC AGAAGTAGTG 350 

5 

AGGAGGCTTT TTTGGAGGCC TAGGCTTTTG CAAAAAGCTA GCTTATCCGG 400 

CCGGGAACGG TGCATTGGAA CGCGGATTCC CCGTGCCAAG AGTGACGTAA 450 

10 GTACCGCCTA TAGAGCGATA AGAGGATTTT ATCCCCGCTG CCATCATGGT 500 

TCGACCATTG AACTGCATCG TCGCCGTGTC CCAAAATATG GGGATTGGCA 550 

AGAACGGAGA CCTACCCTGG CCTCCGCTCA GGAACGAGTT CAAGTACTTC 600 

15 

CAAAGAATGA CCACAACCTC TTCAGTGGAA GGTAAACAGA ATCTGGTGAT 650 

TATGGGTAGG AAAACCTGGT TCTCCATTCC TGAGAAGAAT CGACCTTTAA 700 

20 AGGACAGAAT TAATATAGTT CTCAGTAGAG AACTCAAAGA ACCACCACGA 750 

GGAGCTCATT TTCTTGCCAA AAGTTTGGAT GATGCCTTAA GACTTATTGA 800 

ACAACCGGAA TTGGCAAGTA AAGTAGACAT GGTTTGGATA GTCGGAGGCA 850 

25 

GTTCTGTTTA CCAGGAAGCC ATGAATCAAC CAGGCCACCT TAGACTCTTT 900 

GTGACAAGGA TCATGCAGGA ATTTGAAAGT GACACGTTTT TCCCAGAAAT 9 50 

30 TGATTTGGGG AAATATAAAC CTCTCCCAGA ATACCCAGGC GTCCTCTCTG 1000 

AGGTCCAGGA GGAAAAAGGC ATCAAGTATA AGTTTGAAGT CTACGAGAAG 1050 

AAAGACTAAC AGGAAGATGC TTTCAAGTTC TCTGCTCCCC TCCTAAAGCT 1100 

35 

ATGCATTTTT ATAAGACCAT GGGACTTTTG CTGGCTTTAG ATCCCCTTGG 1150 

CTTCGTTAGA ACGCAGCTAC AATTAATACA TAACCTTATG TATCATACAC 1200 

40 ATACGATTTA GGTGACACTA TAGATAACAT CCACTTTGCC TTTCTCTCCA 1250 

CAGGTGTCCA CTCCCAGGTC CAACTGCACC TCGGTTCTAT CGATTGAATT 1300 

CCACCATGGG ATGGTCATGT ATCATCCTTT TTCTAGTAGC AACTGCAACT 13 50 

45 

GGAGTACATT CAGAAGTTCA GCTAGTGCAG TCTGGCGGTG GCCTGGTGCA 14 00 

GCCAGGGGGC TCACTCCGTT TGTCCTGTGC AGCTTCTGGC TACTCCTTCT 1450 

50 CGAGTCACTA TATGCACTGG GTCCGTCAGG CCCCGGGTAA GGGCCTGGAA 1500 

TGGGTTGGAT ATATTGATCC TTCCAATGGT GAAACTACGT ATAATCAAAA 1550 

GTTCAAGGGC CGTTTCACTT TATCTCGCGA CAACTCCAAA AACACAGCAT 1600 

55 

ACCTGCAGAT GAACAGCCTG CGTGCTGAGG ACACTGCCGT CTATTACTGT 1650 
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GCAAGAGGGG ATTATCGCTA CAATGGTGAC TGGTTCTTCG ACGTCTGGGG 1700 
TCAAGGAACC CTGGTCACCG TCTCCTCGGC CTCCACCAAG GGCCCATCGG 1750 

5 

TCTTCCCCCT GGCACCCTCC TCCAAGAGCA CCTCTGGGGG CACAGCGGCC 1800 



CTGGGCTGCC TGGTCAAGGA CTACTTCCCC GAACCGGTGA CGGTGTCGTG 1850 
10 GAACTCAGGC GCCCTGACCA GCGGCGTGCA CACCTTCCCG GCTGTCCTAC 1900 



AGTCCTCAGG ACTCTACTCC CTCAGCAGCG TGGTGACTGT GCCCTCTAGC 1950 



AGCTTGGGCA CCCAGACCTA CATCTGCAAC GTGAATCACA AGCCCAGC AA 2000 

15 

CACCAAGGTG GACAAGAAAG TTGAGCCCAA ATCTTGTGAC AAAACTCACA 2050 



CATGCCCACC GTGCCCAGCA CCTGAACTCC TGGGGGGACC GTCAGTCTTC 2100 
20 CTCTTCCCCC CAAAACCCAA GGACACCCTC ATGATCTCCC GGACCCCTGA 2150 



GGTCACATGC GTGGTGGTGG ACGTGAGCCA CGAAGACCCT GAGGTCAAGT 2200 

TCAACTGGTA CGTGGACGGC GTGGAGGTGC ATAATGCCAA GACAAAGCCG 2250 

25 

CGGGAGGAGC AGTACAACAG CACGTACCGT GTGGTCAGCG TCCTCACCGT 2300 



CCTGCACCAG GACTGGCTGA ATGGCAAGGA GTACAAGTGC AAGGTCTCCA 2350 
30 ACAAAGCCCT CCCAGCCCCC ATCGAGAAAA CCATCTCCAA AGCCAAAGGG 2400 



CAGCCCCGAG AACCACAGGT GTACACCCTG CCCCCATCCC GGG AAGAGAT 2450 



GACQAAGAAC CAGGTCAGCC TGACCTGCCT GGTCAAAGGC TTCTATCCCA 2500 

35 

GCGACATCGC CGTGGAGTGG GAGAGCAATG GGCAGCCGGA GAACAACTAC 2550 



AAGACCACGC CTCCCGTGCT GGACTCCGAC GGCTCCTTCT TCCTCTACAG 2600 
40 CAAGCTCACC GTGGACAAGA GCAGGTGGCA GCAGGGGAAC GTCTTCTCAT 2650 



GCTCCGTGAT GCATGAGGCT CTGCACAACC ACTACACGCA GAAGAGCCTC 2700 



TCCCTGTCTC CGGGTAAATG AGTGCGACGG CCCTAGAGTC GACCTGCAGA 27 50 

45 

AGCTTGGCCG CCATGGCCCA ACTTGTTTAT TGCAGCTTAT AATGGTTACA 2800 



AATAAAGCAA TAGCATCACA AATTTCACAA ATAAAGCATT TTTTTCACTG 2850 
50 CATTCTAGTT GTGGTTTGTC CAAACTCATC AATGTATCTT ATCATGTCTG 2900 



GATCGATCGG GAATTAATTC GGCGCAGCAC CATGGCCTGA AATAACCTCT 2950 
GAAAGAGGAA CTTGGTTAGG TACCTTCTGA GGCGGAAAGA ACCATCTGTG 3000 

55 

GAATGTGTGT CAGTTAGGGT GTGGAAAGTC CCCAGGCTCC CCAGCAGGCA 3050 
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GAAGTATGCA AAGCATGCAT CTCAATTAGT CAGCAACCAG GTGTGGAAAG 3100 
TCCCCAGGCT CCCCAGCAGG CAGAAGTATG CAAAGCATGC ATCTCAATTA 3150 

5 

GTCAGCAACC ATAGTCCCGC CCCTAACTCC GCCCATCCCG CCCCTAACTC 3200 
CGCCCAGTTC CGCCCATTCT CCGCCCCATG GCTGACTAAT TTTTTTTATT 3250 
10 TATGCAGAGG CCGAGGCCGC CTCGGCCTCT GAGCTATTCC AGAAGTAGTG 3300 
AGGAGGCTTT TTTGGAGGCC TAGGCTTTTG CAAAAAGCTA GCTTATCCGG 3350 
CCGGGAACGG TGCATTGGAA CGCGGATTCC CCGTGCCAAG AGTCAGGTAA 3400 

15 

GTACCGCCTA TAGAGTCTAT AGGCCCACCC CCTTGGCTTC GTTAGAACGC 3450 
GGCTACAATT AATACATAAC CTTTTGGATC GATCCTACTG ACACTGACAT 3500 
20 CCACTTTTTC TTTTTCTCCA CAGGTGTCCA CTCCCAGGTC CAACTGCACC 3550 
TCGGTTCGCG AAGCTAGCTT GGGCTGCATC GATTGAATTC CACCATGGGA 3600 
TGGTCATGTA TCATCCTTTT TCTAGTAGCA ACTGCAACTG GAGTACATTC 3650 

25 

AGATATCCAG ATGACCCAGT CCCCGAGCTC CCTGTCCGCC TCTGTGGGCG 3700 
ATAGGGTCAC CATCACCTGC AGGTCAAGTC AAAGCTTAGT ACATGGTATA 37 50 
30 GGTGCTACGT ATTTACACTG GTATCAACAG AAACCAGGAA AAGCTCCGAA 3800 
ACTACTGATT TACAAAGTAT CCAATCGATT CTCTGGAGTC CCTTCTCGCT 3850 
TCTCTGGATC CGGTTCTGGG ACGGATTTCA CTCTGACCAT CAGCAGTCTG 3900 

35 

CAGCCAGAAG ACTTCGCAAC TTATTACTGT TCACAGAGTA CTCATGTCCC 3950 
GCTCACGTTT GGACAGGGTA CCAAGGTGGA GATCAAACGA ACTGTGGCTG 4000 
40 CACCATCTGT CTTCATCTTC CCGCCATCTG ATGAGCAGTT GAAATCTGGA 4050 
ACTGCTTCTG TTGTGTGCCT GCTGAATAAC TTCTATCCCA GAGAGGCCAA 4100 
AGTACAGTGG AAGGTGGATA ACGCCCTCCA ATCGGGTAAC TCCCAGGAGA 4150 

45 

GTGTCACAGA GCAGGACAGC AAGGACAGCA CCTACAGCCT CAGCAGCACC 4200 
CTGACGCTGA GCAAAGCAGA CTACGAGAAA CACAAAGTCT ACGCCTGCGA 42 50 
50 AGTCACCCAT CAGGGCCTGA GCTCGCCCGT CACAAAGAGC TTCAACAGGG 4300 
GAGAGTGTTA AGCTTGGCCG CCATGGCCCA ACTTGTTTAT TGCAGCTTAT 43 50 
AATGGTTACA AATAAAGCAA TAGCATCACA AATTTCACAA ATAAAGCATT 4400 

55 

TTTTTCACTG CATTCTAGTT GTGGTTTGTC CAAACTCATC AATGTATCTT 4450 
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ATCATGTCTG GATCGATCGG GAATTAATTC GGCGCAGCAC CATGGCCTGA 4500 
AATAACCTCT GAAAGAGGAA CTTGGTTAGG TACCTTCTGA GGCGGAAAGA 4550 

5 

ACCAGCTGTG GAATGTGTGT CAGTTAGGGT GTGGAAAGTC CCCAGGCTCC 4600 
CCAGCAGGCA GAAGTATGCA AAGCATGCAT CTCAATTAGT CAGCAACCAG 4650 
0 GTGTGGAAAG TCCCCAGGCT CCCCAGCAGG CAGAAGTATG CAAAGCATGC 4700 
ATCTCAATTA GTCAGCAACC ATAGTCCCGC CCCTAACTCC GCCCATCCCG 4750 
CCCCTAACTC CGCCCAGTTC CGCCCATTCT CCGCCCCATG GCTGACTAAT 4800 

15 

TTTTTTTATT TATGCAGAGG CCGAGGCCGC CTCGGCCTCT GAGCTATTCC 4850 
AGAAGTAGTG AGGAGGCTTT TTTGGAGGCC TAGGCTTTTG CAAAAAGCTG 4900 
20 TTACCTCGAG CGGCCGCTTA ATTAAGGCGC GCCATTTAAA TCCTGCAGGT 4950 
AACAGCTTGG CACTGGCCGT CGTTTTACAA CGTCGTGACT GGGAAAACCC 5000 
TGGCGTTACC CAACTTAATC GCCTTGCAGC ACATCCCCCC TTCGCCAGCT 5050 

25 

GGCGTAATAG CGAAGAGGCC CGCACCGATC GCCCTTCCCA ACAGTTGCGT 5100 
AGCCTGAATG GCGAATGGCG CCTGATGCGG TATTTTCTCC TTACGCATCT 5150 
30 GTGCGGTATT TCACACCGCA TACGTCAAAG CAACCATAGT ACGCGCCC.TG 5200 
TAGCGGCGCA TTAAGCGCGG CGGGTGTGGT GGTTACGCGC AGCGTGACCG 5250 
CTACACTTGC CAGCGCCCTA GCGCCCGCTC CTTTCGCTTT CTTCCCTTCC 5300 

35 

TTTCTCGCCA CGTTCGCCGG CTTTCCCCGT CAAGCTCTAA ATCGGGGGCT 53 50 
CCCTTTAGGG TTCCGATTTA GTGCTTTACG GCACCTCGAC CCCAAAAAAC 5400 
40 TTGATTTGGG TGATGGTTCA CGTAGTGGGC CATCGCCCTG ATAGACGGTT 5450 
TTTCGCCCTT TGACGTTGGA GTCCACGTTC TTTAATAGTG GACTCTTGTT 5500 
CCAAACTGGA ACAACACTCA ACCCTATCTC GGGCTATTCT TTTGATTTAT 5550 

45 

AAGGGATTTT GCCGATTTCG GCCTATTGGT TAAAAAATGA GCTGATTTAA 5600 
CAAAAATTTA ACGCGAATTT TAACAAAATA TTAACGTTTA CAATTTTATG 5650 
50 GTGCACTCTC AGTACAATCT GCTCTGATGC CGCATAGTTA AGCCAACTCC 5700 
GCTATCGCTA CGTGACTGGG TCATGGCTGC GCCCCGACAC CCGCCAACAC 5750 
CCGCTGACGC GCCCTGACGG GCTTGTCTGC TCCCGGCATC CGCTTACAGA 5800 

55 

CAAGCTGTGA CCGTCTCCGG GAGCTGCATG TGTCAGAGGT TTTCACCGTC 5850 
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ATCACCGAAA CGCGCGAGGC AGTATTCTTG AAGACGAAAG GGCCTCGTGA 5900 
TACGCCTATT TTTATAGGTT AATGTCATGA TAATAATGGT TTCTTAGACG 5950 

5 

TCAGGTGGCA CTTTTCGGGG AAATGTGCGC GGAACCCCTA TTTGTTTATT 6000 
TTTCTAAATA CATTCAAATA TGTATCCGCT CATGAGACAA TAACCCTGAT 6050 
10 AAATGCTTCA ATAATATTGA AAAAGGAAGA GTATGAGTAT TCAACATTTC 6100 
CGTGTCGCCC TTATTCCCTT TTTTGCGGCA TTTTGCCTTC CTGTTTTTGC 6150 
TCACCCAGAA ACGCTGGTGA AAGTAAAAGA TGCTG AAGAT CAGTTGGGTG 6200 

15 

CACGAGTGGG TTACATCGAA CTGGATCTCA ACAGCGGTAA GATCCTTGAG 6250 
AGTTTTCGCC CCGAAGAACG TTTTCCAATG ATGAGCACTT TTAAAGTTCT 6300 
20 GCTATGTGGC GCGGTATTAT CCCGTGATGA CGCCGGGCAA GAGCAACTCG 6350 
GTCGCCGCAT ACACTATTCT CAGAATGACT TGGTTGAGTA CTCACCAGTC 6400 
ACAGAAAAGC ATCTTACGGA TGGCATGACA GTAAGAGAAT TATGCAGTGC 6450 

25 

TGCCATAACC ATGAGTGATA ACACTGCGGC CAACTTACTT CTGACAACGA 6500 
TCGGAGGACC GAAGGAGCTA ACCGCTTTTT TGCACAACAT GGGGGATCAT 6550 
30 GTAACTCGCC TTGATCGTTG GGAACCGGAG CTGAATGAAG CCATACCAAA 6600 
CGACGAGCGT GACACCACGA TGCCAGCAGC AATGGCAACA ACGTTGCGCA 6650 
AACTATTAAC TGGCGAACTA CTTACTCTAG CTTCCCGGCA ACAATTAATA 6700 

35 

GACTGGATGG AGGCGGATAA AGTTGCAGGA CCACTTCTGC GCTCGGCCCT 6750 
TCCGGCTGGC TGGTTTATTG CTGATAAATC TGGAGCCGGT GAGCGTGGGT 6800 
40 CTCGCGGTAT CATTGCAGCA CTGGGGCCAG ATGGTAAGCC CTCCCGTATC 6850 
GTAGTTATCT ACACGACGGG GAGTCAGGCA ACTATGGATG AACGAAATAG 6900 
ACAGATCGCT GAGATAGGTG CCTCACTGAT TAAGCATTGG TAACTGTCAG 6950 

45 

ACCAAGTTTA CTCATATATA CTTTAGATTG ATTTAAAACT TCATTTTTAA 7000 
TTTAAAAGGA TCTAGGTGAA GATCCTTTTT GATAATCTCA TGACCAAAAT 7050 
50 CCCTTAACGT GAGTTTTCGT TCCACTGAGC GTCAGACCCC GTAGAAAAGA 7100 
TCAAAGGATC TTCTTGAGAT CCTTTTTTTC TGCGCGTAAT CTGCTGCTTG 7150 
CAAACAAAAA AACCACCGCT ACCAGCGGTG GTTTGTTTGC CGGATCAAGA 7200 

55 

GCTACCAACT CTTTTTCCGA AGGTAACTGG CTTCAGCAGA GCGCAGATAC 7250 
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CAAATACTGT CCTTCTAGTG TAGCCGTAGT TAGGCCACCA CTTCAAGAAC 7300 
TCTGTAGCAC CGCCTACATA CCTCGCTCTG CTAATCCTGT TACCAGTGGC 7350 

5 

TGCTGCCAGT GGCGATAAGT CGTGTCTTAC CGGGTTGGAC TCAAGACGAT 7400 
AGTTACCGGA TAAGGCGCAG CGGTCGGGCT GAACGGGGGG TTCGTGCACA 7450 
10 CAGCCCAGCT TGGAGCGAAC GACCTACACC GAACTGAGAT ACCTACAGCG 7500 
TGAGCATTGA GAAAGCGCCA CGCTTCCCGA AGGGAGAAAG GCGGACAGGT 7550 
ATCCGGTAAG CGGCAGGGTC GGAACAGGAG AGCGCACGAG GGAGCTTCCA 7600 

15 

GGGGGAAACG CCTGGTATCT TTATAGTCCT GTCGGGTTTC GCCACCTCTG 7650 
ACTTGAGCGT CGATTTTTGT GATGCTCGTC AGGGGGGCGG AGCCTATGGA 7700 
*20 AAAACGCCAG CAACGCGGCC TTTTTACGGT TCCTGGCCTT TTGCTGGCCT 7750 

I TTTGCTCACA TGTTCTTTCC TGCGTTATCC CCTGATTCTG TGGATAACCG 7 800 

J 

jj TATTACCGCC TTTGAGTGAG CTGATACCGC TCGCCGCAGC CGAACGACCG 7850 

jj AGCGCAGCGA GTCAGTGAGC GAGGAAGCGG AAGAGCGCCC AATACGCAAA 7 900 

[I CCGCCTCTCC CCGCGCGTTG GCCGATTCAT TAATCCAGCT GGCACGACAG 7 950 

30 GTTTCCCGAC TGGAAAGCGG GCAGTGAGCG CAACGCAATT AATGTGAGTT 8000 

ACCTCACTCA TTAGGCACCC CAGGCTTTAC ACTTTATGCT TCCGGCTCGT 8050 

!? ATGTTGTGTG GAATTGTGAG CGGATAACAA TTTCACACAG GAAACAGCTA 8100 

TGACCATGAT TACGAATTAA 8120 
(2) INFORMATION FOR SEQ ID NO: 69: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 800 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 



AAAAGGGTAT CTAGAGGTTG AGGTGATTTT ATGAAAAAGA ATATCGCATT 50 

50 

TCTTCTTGCA TCTATGTTCG TTTTTTCTAT TGCTACAAAC GCGTACGCTG 100 
AGGTTCAGCT AGTGCAGTCT GGCGGTGGCC TGGTGCAGCC AGGGGGCTCA 150 
55 CTCCGTTTGT CCTGTGCAGC TTCTGGCTAC TCCTTCTCGA GTCACTATAT 200 
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GCACTGGGTC CGTCAGGCCC CGGGTAAGGG CCTGGAATGG GTTGGATATA 250 
TTGATCCTTC CAATGGTGAA ACTACGTATA ATCAAAAGTT CAAGGGCCGT 300 
5 TTCACTTTAT CTCGCGACAA CTCCAAAAAC ACAGCATACC TGCAGATGAA 350 
CAGCCTGCGT GCTGAGGACA CTGCCGTCTA TTACTGTGCA AGAGGGGATT 400 
ATCGCTACAA TGGTGACTGG TTCTTCGACG TCTGGGGTCA AGGAACCCTG 450 

10 

GTCACCGTCT CCTCGGCCTC CACCAAGGGC CCATCGGTCT TCCCCCTGGC 500 
ACCCTCCTCC AAGAGCACCT CTGGGGGCAC AGCGGCCCTG GGCTGCCTGG 550 
15 TCAAGGACTA CTTCCCCGAA CCGGTGACGG TGTCGTGGAA CTCAGGCGCC 600 
CTGACCAGCG GCGTGCACAC CTTCCCGGCT GTCCTACAGT CCTCAGGACT 650 
CTACTCCCTC AGCAGCGTGG TGACCGTGCC CTCCAGCAGC TTGGGCACCC 700 

20 

AGACCTACAT CTGCAACGTG AATCACAAGC CCAGCAACAC CAAGGTCGAC 750 

AAGAAAGTTG AGCCCAAATC TTGTGACAAA ACTCACACAT GCCCGCCTGA 800 

25 (2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

i (A) LENGTH: 256 amino acids 

s {B) TYPE: Amino Acid 

" 30 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

I Met; Lys Lys Asn 

J 35 1 

I Ser He Ala Thr 

40 Gly Gly Gly Leu 
Ala Ala Ser Gly 

45 

Arg Gin Ala Pro 
Pro Ser Asn Gly 

50 

Phe Thr Leu Ser 
55 Met Asn Ser Leu 



He Ala Phe Leu Leu Ala Ser 
5 10 



Met Phe Val Phe 
15 



Asn Ala Tyr Ala Glu Val Gin 
20 25 



Leu Val Gin Ser 
30 



Val Gin Pro Gly Gly Ser Leu 
35 40 



Arg Leu Ser Cys 

. 45 



Tyr Ser Phe Ser Ser His Tyr 
50 55 



Met His Trp Val 
60 



Gly Lys Gly Leu Glu Trp Val 
65 70 



Gly Tyr He Asp 
75 



Glu Thr Thr Tyr Asn Gin Lys 
80 85 



Phe Lys Gly Arg 
90 



Arg Asp Asn Ser Lys Asn Thr 
95 100 

Arg Ala Glu Asp Thr Ala Val 
110 115 



Ala Tyr Leu Gin 
105 

Tyr Tyr Cys Ala 
120 
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Arg Gly Asp Tyr Arg Tyr Asn Gly Asp Trp Phe Phe Asp Val Trp 

125 130 135 

5 Gly Gin Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly 

140 145 150 

Pro Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly 

155 160 165 

0 

Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu 

170 175 180 

Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val 

15 185 190 195 

His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu 

200 205 210 

20 Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr 

215 220 225 



25 



40 



55 



Tyr lie Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp 
230 235 240 

Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro 
. 245 250 . 255 



Pro 

30 256 

(2) INFORMATION FOR SEQ ID NO: 71: 

fi) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 452 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

Glu Val Gin Leu Val Gin Ser Gly Gly Gly Leu Val Gin Pro Gly 
15 10 15 



Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Tyr Ser Phe Ser 

45 20 25 30 

Ser His Tyr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu 

35 40 45 

50 Glu Trp Val Gly Tyr lie Asp Pro Ser Asn Gly Glu Thr Thr Tyr 

50 55 60 



Asn Gin Lys Phe Lys Gly Arg Phe Thr Leu Ser Arg Asp Asn Ser 
65 70 75 

Lys Asn Thr Ala Tyr Leu Gin Met Asn Ser Leu Arg Ala Glu Asp 
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80 85 90 

Thr Ala Val Tyr Tyr Cys Ala Arg Gly Asp Tyr Arg Tyr Asn Gly 

95 100 105 

Asp Trp Phe Phe Asp Val Trp Gly Gin Gly Thr Leu Val Thr Val 

110 115 120 

Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro 

125 130 135 

Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly Cys Leu 

140 145 150 

Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser 

155 160 165 

Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val Leu Gin 

170 175 180 

J 

Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro Ser 

185 190 195 

Ser Ser Leu Gly Thr Gin Thr Tyr lie Cys Asn Val Asn His Lys 

.5 200 205 210 

Pro Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys 

215 220 225 

30 Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu 

230 235 240 

Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr 

245 250 255 

35 

Leu Met He Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp 

260 265 270 

Val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp 

40 275 280 285 

Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gin 

290 295 300 

45 Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His 

305 310 315 

Gin Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn 

320 325 330 

50 

Lys Ala Leu Pro Ala Pro He Glu Lys Thr He Ser Lys Ala Lys 

335 340 345 

Gly Gin Pro Arg Glu Pro Gin Val Tyr Thr Leu Pro Pro Ser Arg 

55 350 355 360 
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Glu Glu Met Thr Lys Asn Gin Val Ser Leu Thr Cys Leu Val Lys 
365 370 375 

Gly Phe Tyr Pro Ser Asp He Ala Val Glu Trp Glu Ser Asn Gly 
380 385 390 

Gin Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser 
395 400 405 

Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser 
410 415 420 

Arg Trp Gin Gin Gly Asn Val Phe Ser Cys Ser Val Met His Glu 
425 430 435 

Ala Leu His Asn His Tyr Thr Gin Lys Ser Leu Ser Leu Ser Pro 
440 445 450 



Gly Lys 
J 452 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 219 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

30 

Asp He Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val 
15 10 15 

Gly Asp Arg Val Thr He Thr Cys Arg Ser Ser Gin Ser Leu Val 
35 20 25 30 

His Gly He Gly Ala Thr Tyr Leu His Trp Tyr Gin Gin Lys Pro 
35 40 45 

40 Gly Lys Ala Pro Lys Leu Leu lie Tyr Lys Val Ser Asn Arg Phe 

50 55 60 

Ser Gly Val Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp 

65 70 75 

45 

Phe Thr Leu Thr He Ser Ser Leu Gin Pro Glu Asp Phe Ala Thr 

80 85 90 

Tyr Tyr Cys Ser Gin Ser Thr His Val Pro Leu Thr Phe Gly Gin 
50 95 100 105 

Gly Thr Lys Val Glu He Lys Arg Thr Val Ala Ala Pro Ser Val 
110 115 120 

55 Phe He Phe Pro Pro Ser Asp Glu Gin Leu Lys Ser Gly Thr Ala 

125 130 135 
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tsn * s „ Phe ly» «• K, 01- ^ %% 

ser v .x v.i cyujj «-» i« 

, MD Ber W. » «« T " WI S " "o 

Leu Ser ^ « ^ WS HU 

Thr Leu Thr Leu Ser uy igQ 

185 Cer ser Pro Val 

n TV,r His Gin Gly Leu Ser Ser 
A la Cys Glu Val Thr His ^ 
200 

iv^n Glv Glu Cys 
Thr Lys Ser Phe A.n Arg Gly ^ 



Ser Ser 
Val Tyr 
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